A generic algorithm for generating spoken monologues

نویسندگان

  • Esther Klabbers
  • Emiel Krahmer
  • Mariët Theune
چکیده

The defining property of a Concept-to-Speech system is that it combines language and speech generation. Language generation converts the input concepts into natural language, which speech generation subsequently transforms into speech. Potentially, this leads to a more ‘natural sounding’ output than can be achieved in a plain Text-to-Speech system, since the correct placement of pitch accents and intonational boundaries —an important factor contributing to the ‘naturalness’ of the generated speech— is co-determined by syntactic and discourse information, which is typically available in the language generation module. In this paper, a generic algorithm for the generation of coherent spoken monologues is discussed, called D2S. Language generation is done by a module called LGM which is based on TAG-like syntactic structures with open slots, combined with conditions which determine when the syntactic structure can be used properly. A speech generation module (SGM) converts the output of the LGM into speech using either phrase-concatenation or diphone-synthesis.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Data Corpus for Verbal Intelligence Estimation

The goal of our research is the development of algorithms for automatic estimation of a person’s verbal intelligence based on the analysis of transcribed spoken utterances. In this paper we present the corpus of German native speakers’ monologues and dialogues about the same topics collected at the University of Ulm, Germany. The monologues were descriptions of two short films; the dialogues we...

متن کامل

Dependency Parsing of Japanese Spoken Monologue Based on Clause Boundaries

Spoken monologues feature greater sentence length and structural complexity than do spoken dialogues. To achieve high parsing performance for spoken monologues, it could prove effective to simplify the structure by dividing a sentence into suitable language units. This paper proposes a method for dependency parsing of Japanese monologues based on sentence segmentation. In this method, the depen...

متن کامل

Discourse Structure in Spoken Language: Studies on Speech Corpora

A better understanding of the intonational charaeteristics of spoken discourse may lead to new empirical techniques for identifying discourse structure from speech, as well as new algorithms for enhancing the naturalness of synthetic speech. This paper summarizes results of pilot studies that demonstrate reliable correlations of discourse and speech properties, and reports findings on a new cor...

متن کامل

Midcourse Trajectory Shaping for Air and Ballistic Defence Guidance Using Bezier Curves

A near-optimal midcourse trajectory shaping guidance algorithm is proposed for both air and ballistic target engagement mission attributes for generic long range interceptor missile. This guidance methodology is based on the maximum final velocity as the objective function and maximum permissible flight altitude as the in-flight state constraint as well as the head-on orientation as the termina...

متن کامل

An Information Structural Approach to Spoken Language Generation

This paper presents an architecture for the generation of spoken monologues with contextually appropriate intonation. A twotiered information structure representation is used in the high-level content planning and sentence planning stages of generation to produce e cient, coherent speech that makes certain discourse relationships, such as explicit contrasts, appropriately salient. The system is...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998